Lexical Semantic based Bayesian Model for Adaptive Wrapper Generation
نویسندگان
چکیده
منابع مشابه
Example-Based Wrapper Generation
Extracting specific information from the vast amount of documents in the World Wide Web is a very tedious task. Manual extraction has high quality output but cannot be automated. Programmed wrappers, on the other hand, suffer from the uncertainty of document structures. The generation of a more generic wrapper for whole classes of textual information, which can accommodate all kinds of document...
متن کاملA Model for High-coverage Lexical Semantic Annotation Generation
AI applications often receive their input in the form of natural language text, or as the transcription of spoken text. A commonsense inference system should transform such input to a formal representation with limited vocabulary in order to be able to process them. In this paper, we present a method based on neural word embeddings that automatically assigns semic features to words of natural l...
متن کاملBayesian Discriminant Analysis for Lexical Semantic Tagging
Structuring of terminology automatically is interesting for information extraction or indexing. It remains to evaluate the results obtained. We use a clustering method to build term classes assuming that (even) an incomplete thesaurus could inform the user about the semantic interpretation of classes. A variant naive Bayesian analysis is used to model an association network between terms and ca...
متن کاملPredicting waste generation using Bayesian model averaging
A prognosis model has been developed for solid waste generation from households in Hoi An City, a famous tourist city in Viet Nam. Waste sampling, followed by a questionnaire survey, was carried out to gather data. The Bayesian model average method was used to identify factors significantly associated with waste generation. Multivariate linear regression analysis was then applied to evaluate th...
متن کاملSemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources
A wealth of data on the World Wide Web is hidden behind web form query interfaces and cannot be found through regular search engines. Querying across multiple such sources is a tedious and error-prone process; it involves manually filling in many related, but different, web forms. SemaForm automates this process by correlating web form labels to entries in a domain ontology through the use of a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Engineering
سال: 2012
ISSN: 1877-7058
DOI: 10.1016/j.proeng.2012.06.387